Reinforcement theory

Results: 290



#Item
161The International Journal of Robotics Research http://ijr.sagepub.com/ Learning variable impedance control Jonas Buchli, Freek Stulp, Evangelos Theodorou and Stefan Schaal

The International Journal of Robotics Research http://ijr.sagepub.com/ Learning variable impedance control Jonas Buchli, Freek Stulp, Evangelos Theodorou and Stefan Schaal

Add to Reading List

Source URL: www-clmc.usc.edu

Language: English - Date: 2011-05-27 13:37:50
162Robot Learning with State-Dependent Exploration Thomas R¨uckstieß, Martin Felder, Frank Sehnke, J¨urgen Schmidhuber Abstract— Policy gradient algorithms are among the few learning methods successfully applied to dem

Robot Learning with State-Dependent Exploration Thomas R¨uckstieß, Martin Felder, Frank Sehnke, J¨urgen Schmidhuber Abstract— Policy gradient algorithms are among the few learning methods successfully applied to dem

Add to Reading List

Source URL: www6.in.tum.de

Language: English - Date: 2013-05-15 08:23:46
163The Study of Political Campaigns Henry E. Brady, Richard Johnston, and John Sides D

The Study of Political Campaigns Henry E. Brady, Richard Johnston, and John Sides D

Add to Reading List

Source URL: www.press.umich.edu

Language: English - Date: 2012-11-26 13:55:47
164The Benefits of a Behavioral Approach to Safety:

The Benefits of a Behavioral Approach to Safety:

Add to Reading List

Source URL: www.safetyperformance.com

Language: English - Date: 2011-12-09 10:46:01
165Parametric Policy Gradients for Robotics Frank Sehnke, Thomas R¨uckstieß, Martin Felder and J¨urgen Schmidhuber Abstract— Slow convergence is a major problem for policy gradient methods. It is a consequence of the f

Parametric Policy Gradients for Robotics Frank Sehnke, Thomas R¨uckstieß, Martin Felder and J¨urgen Schmidhuber Abstract— Slow convergence is a major problem for policy gradient methods. It is a consequence of the f

Add to Reading List

Source URL: www6.in.tum.de

Language: English - Date: 2013-05-15 08:23:49
166Copyright by David Merrill Pardoe 2011  The Dissertation Committee for David Merrill Pardoe

Copyright by David Merrill Pardoe 2011 The Dissertation Committee for David Merrill Pardoe

Add to Reading List

Source URL: apps.cs.utexas.edu

Language: English - Date: 2011-06-06 08:34:50
167A	
  Dynamic	
  Data	
  Driven	
  Cogni1ve	
   Control	
  for	
  Explora1on/Exploita1on	
   Jose	
  Principe	
  and	
  Panos	
  Pardalos	
   University	
  of	
  Florida	
   	
   principe@cnel.ufl.edu

A  Dynamic  Data  Driven  Cogni1ve   Control  for  Explora1on/Exploita1on   Jose  Principe  and  Panos  Pardalos   University  of  Florida     principe@cnel.ufl.edu

Add to Reading List

Source URL: www.dddas.org

Language: English - Date: 2013-10-28 19:45:27
168Policy Gradients with Parameter-Based Exploration for Control Frank Sehnke1 , Christian Osendorfer1 , Thomas R¨ uckstieß1 , 1 3

Policy Gradients with Parameter-Based Exploration for Control Frank Sehnke1 , Christian Osendorfer1 , Thomas R¨ uckstieß1 , 1 3

Add to Reading List

Source URL: www6.in.tum.de

Language: English - Date: 2013-05-15 08:23:49
169Risk, Reinforcement, Retention in Treatment, and Reoffending for Boys and Girls in Multidimensional Treatment Foster Care  DANA K. SMITH

Risk, Reinforcement, Retention in Treatment, and Reoffending for Boys and Girls in Multidimensional Treatment Foster Care DANA K. SMITH

Add to Reading List

Source URL: www.mtfc.com

Language: English - Date: 2008-09-30 12:59:10
170PALADYN Journal of Behavioral Robotics  Review Article · DOI: [removed]s13230[removed] · JBR · 1(1) · 2010 · 14-24 Exploring Parameter Space in Reinforcement Learning

PALADYN Journal of Behavioral Robotics Review Article · DOI: [removed]s13230[removed] · JBR · 1(1) · 2010 · 14-24 Exploring Parameter Space in Reinforcement Learning

Add to Reading List

Source URL: www6.in.tum.de

Language: English - Date: 2013-05-17 09:57:48